Dynamic Cost-sensitive Naive Bayes Classification for Uncertain Data

نویسنده

  • Yuwen Huang
چکیده

The uncertain data as an important aspect of data mining, has received considerable attention, due to its importance in many applications, but little study has been paid to the cost-sensitive classification on uncertain data, so this paper proposes the dynamic costsensitive Naive Bayes classification for mining uncertain data (DCSUNB). Firstly, we apply the probability density to dispose uncertain discrete and continuous attributes, and give the cost-sensitive Naive Bayes classifier. Secondly, we propose the construction process of dynamic cost, and give the evaluation method for finding the optimal cost and the cost-sensitive classification with sequential test strategy. At last, the dynamic costsensitive Naive Bayes algorithm for uncertain data is structured, which searches the misclassification and test cost spaces to find the optimal cost. By comparing to the other cost-sensitive classification algorithms for uncertain data, the experiments on UCI Datasets show that DCSUNB can improve the classification performance, and reduce effectively the total cost.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cost-sensitive Naïve Bayes Classification of Uncertain Data

Data uncertainty is widespread in real-word applications. It has captured a lot of attention, but little job has been paid to the research of cost sensitive algorithm on uncertain data. The paper proposes a novel cost-sensitive Naïve Bayes algorithm CS-DTU for classifying and predicting uncertain datasets. In the paper, we apply probability and statistics theory on uncertain data model, define ...

متن کامل

A New Approach for Text Documents Classification with Invasive Weed Optimization and Naive Bayes Classifier

With the fast increase of the documents, using Text Document Classification (TDC) methods has become a crucial matter. This paper presented a hybrid model of Invasive Weed Optimization (IWO) and Naive Bayes (NB) classifier (IWO-NB) for Feature Selection (FS) in order to reduce the big size of features space in TDC. TDC includes different actions such as text processing, feature extraction, form...

متن کامل

‎A Bayesian mixture model‎ for classification of certain and uncertain data

‎There are different types of classification methods for classifying the certain data‎. ‎All the time the value of the variables is not certain and they may belong to the interval that is called uncertain data‎. ‎In recent years‎, ‎by assuming the distribution of the uncertain data is normal‎, ‎there are several estimation for the mean and variance of this distribution‎. ‎In this paper‎, ‎we co...

متن کامل

Hierarchical Naive Bayes Classifiers for uncertain data

In experimental sciences many classification problems deal with variables with replicated measurements. In this case the replicates are usually summarized by their mean or median. However, such choice does not consider the information about the uncertainty associated with the measurements, thus potentially leading to over or underestimate the probability associated to each classification. In th...

متن کامل

A Novel Bayesian Classification Technique for Uncertain Data

Data uncertainty can be caused by numerous factors such as measurement precision limitations, network latency, data staleness and sampling errors. When mining knowledge from emerging applications such as sensor networks or location based services, data uncertainty should be handled cautiously to avoid erroneous results. In this paper, we apply probabilistic and statistical theory on uncertain d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015